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Can one hear the 'sound' of a growing network? We address the 
problem of recognizing the topology of evolving biological or social 
networks. Starting from percolation theory, we analytically prove a linear 
inverse relationship between two simple graph parameters — the logarithm 
of the average cluster size and logarithm of the ratio of the edges of the 
graph to the theoretically maximum number of edges for that graph — that 
holds for all growing power law graphs. The result establishes a novel 
property of evolving power-law networks in the asymptotic limit of network 
size. Numerical simulations as well as fitting to real-world citation co- 
authorship networks demonstrate that the result holds for networks of 
finite sizes, and provides a convenient measure of the extent to which an 
evolving family of networks belongs to the same power-law class. 
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Mark Kac posed the question "Can one hear the shape of a drum?" (1), 
asking whether one could infer the geometric shape of a drum from its vibrational 
modes. Here we explore a similar question about the evolution of complex 
networks. The structure and properties of complex networks (2-4), such as the 
Internet (5, 6), citation networks (7), biological interaction network such as 
metabolic (8), protein-interaction networks (9), and disease-gene network (10) 
have been investigated intensively over the past decade. Many of these real 
world networks belong to the power-law family where the number of nodes with 
k neighbors, N(k), scales as a power law of the form N(k) ~k~ p where p is the 

scaling coefficient (11). Several mechanisms by which power law networks arise 
through evolutionary mechanisms were proposed by considering the structure of 
the real-world network as a target and by constructing models of network growth 
that produce such a network given an arbitrary seed graph as the initial condition 
(12-15). An important problem that arises in evaluating models of network 
evolution and in testing these against real data is the need to definitively 
characterize experimentally determined networks as belonging to the power-law 
family (16). Here we consider a related problem — given a time-series of network 
structure data, can one deduce whether the collection of networks belongs to the 
same power-law class? 

In percolation theory (17), the vertices of a lattice are "occupied" or 
connected with a certain probability p , and the emergence of connected clusters 
are studied as a function of p . Of interest is the existence of a path that would 
allow a liquid, as it were, to percolate from the upper surface of the lattice to its 

lower surface. Many of the canonical results in percolation theory are in the form 
of power laws W oc (p - p ) k where p c is a constant value denoting the critical 

probability at which a phase transition occurs, and W is a parameter, such as 
the average cluster size being studied. There is no equivalent theoretical 
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framework for evolving real-world networks that belong to a power-law family. 

Consider a graph with ./V nodes and E edges. If N(k) is the number of 

nodes of degree k , the associated probability distribution of nodes is given by 
P(k) = N ^ , where K is the maximum degree. The average degree of the 

graph (k) = Jk = \kP(k)= — . The fraction of edges, F , relative to the 

k=l N k=l N 

number of all possible edges in the complete graph on N nodes, is 

E 2E (k) 

7^\=777^ — — = ^ ' - . We define W k , the likelihood that an edge chosen at 



N(N-\) N-\ 



\2/ 



kN(k) kN(k) k 

random is connected to a node of degreed, as -= — = - v =—P(k). 

2»<*> {k)N {k) 



k = \ 



Using a term from percolation theory we define the average cluster size 

K 

W = ^W k N(k). The generalized moments X nm are given by the expression 



k=l 

K 



^k n P(k) m . Note that X ll =(k) . For networks where K is large, it can be proved 



/c=l 



that w . F - ( see analytical derivation in Methods). 

The analysis above demonstrates that for power-law networks that belong 
to the same class W scales asl/Fin the limit of large N. We now demonstrate by 
means of simulations that this scaling law holds for power-law networks of finite 
size (Fig 1A) but not to Erdos-Renyi random networks (Fig 1B). The scaling of 
W as a function of F can be used to discriminate sensitively between types of 
networks produced by previously described network growth models by full or 
partial node duplication (15). The graphs in Figs 2A and 2B correspond to 

networks constructed according to a partial duplication model, which produces 

V5 -1 

power-law networks for values of p,p< — - — (7, 14, 15, 18). As p approaches 
1, the degree distribution begins to deviate from a power-law distribution. This is 
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shown by the graphs in Figs 2C and 2D, which show the progressive deviation 
from the power-law as reflected both in the graphs of N(k) vs. A: and W vs. Fon 

the log-log scale. 

We have tested for the scaling law described above in data from the 
Citeseer database (19, 20). For each year between 1991 and 1999, we 
extracted the list of papers published and the number of citations to each paper 
and plotted the degree distribution for citation network for each year (Fig 3A). 
The degree distributions appear to fit poorly a power-law. In Fig 3B we plot W as 
a function of F for these 9 networks, and on the log-log scale I/Vis approximately 
a linear function of F with a slope of -0.86 (close to -1 as predicted by our 
analysis). 

The result described above for the average cluster size W varying as an 

inverse function of F does not discriminate among different types of power-law 

networks — it identifies growing networks in this broad class. Here we only study 
the behavior of the moment A, 2 as a function of the fraction of possible edges F 

. The moments that we define in equation 2 are analogous to the moments of a 
probability distribution, and may serve as a rigorous method for characterization 
of the structure of complex networks and their modes of evolution. This simple 
scaling law should aid diverse purposes where network based simulation is 
important, such as finding patterns in social networks and for epidemiological 
modeling to optimize immunization strategies. Moreover, modularization of large 
and complex networks into component subgraphs, each with a power-law 
topology, might also be possible. 
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Figure Legends: 

Figure 1. Characteristic parameters that describe the growth features of 

power law networks. Beginning with a fully connected graph on 3 nodes 

and growing it according to the preferential attachment scheme of Barabsi 
and Albert (11) with m 0 =2 and for t = 100, 200, 300, ...2000, twenty 

networks of increasing sizes were produced. The degree distributions of 
these networks approach a power-law in the asymptotic limit (11). For 
each of these twenty networks we computed Wand F (panel A). This is a 
considerable reduction of complexity: where we once had twenty different 
degree distributions, each of varying goodness of fit to the ideal power-law 
distribution, we now have twenty points whose colinearity in a log-log plot 
can be easily checked. In panel B, we simulated 20 Erdos-Renyi random 
networks with 100 nodes where F was varied in steps of size 0.01 from 
0.4 to 0.6. For each of these random networks we measured the average 
cluster size l/Vand plotted as a function of F. 

Figure 2 Characteristic parameters describe network growth models. 
Networks were produced using the partial node and edge duplication 
model (15)forp = 0.1, 0.2, 0.3, and 0.4 (panels A, C), and forp = 0.7, 0.8, 
0.9, and 0.95, respectively (B, D). For p < 0.5, the model is expected to 
produce power law network but for p > 0.5 the networks progressively 
deviate from the power law. 

Figure 3 To test whether the derived property holds for real networks, 
citation data of papers published during 1991-1999 were analyzed. For 
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each year, we built a citation network and plotted the degree distribution, 
N(k), as a function of k in panel (A). The degree distributions appear to be 
noisy power-laws. For each of the 9 networks in panel (A), we computed 
W and F, and plotted these on a log-log scale in panel (B), showing that 
the growth property indeed belongs to the power law family. The noisy 
inverse relationship we observe characterizes the networks as those 
growing in the same power-law family, but in a noisy manner. 



Methods 

Analytical Derivation: For networks where the maximum degree, K is 

large, we can approximate the sum in the definition of average cluster size 

P(k)dk=C[ k~ p dk = C . 

Jl { 08-1) ) 

The normalization constant, C, depends on /? and K and is given by 

(0-1) 



C 



Hence, for large K and /S>1, the limiting value of C is 



C = /?-!. Substituting in the expression for generalized moment (see 



text), 



we 

K m = yk"P(k) m - C m r k"- mp dk = C m ,n -pm*-l. Using 

m J 1 n- /3m + 1 



obtain 
the 



normalized value of C we can rewrite the above as 



A,. _ ~ C" 



/ js-n-m/3+l _ ^ \ 



n - m/3 + 1 



(0-1) 



m \ 



l-K 



, which simplifies in the 



^ n-mfi + 1 

(13 -ir 

limit of large ^ to \ = H m A„ m = — 

The relationship between the quantities 

W and F is obtained from the definition of F= — = — and 

iV-1 iV-l 
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K K 

ykN(kf ykP(kf 



w = yw k N(k) ■■ 



K K 



N- 



K,2 



N. In the case of large 



networks or in 



W .F = 



N 



N-l 



A 12 or,W.F = 



the limit as K -» <x> we obtain, 
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